Optical Character Recognition from Text Image
نویسندگان
چکیده
Optical Character Recognition (OCR) is a system that provides a full alphanumeric recognition of printed or handwritten characters by simply scanning the text image. OCR system interprets the printed or handwritten characters image and converts it into corresponding editable text document. The text image is divided into regions by isolating each line, then individual characters with spaces. After character extraction, the texture and topological features like corner points, features of different regions, ratio of character area and convex area of all characters of text image are calculated. Previously features of each uppercase and lowercase letter, digit, and symbols are stored as a template. Based on the texture and topological features, the system recognizes the exact character using feature matching between the extracted character and the template of all characters as a measure of similarity.
منابع مشابه
OCR for printed Kannada text to Machine editable format using Database approach
This paper describes an Optical Character Recognition (OCR) system for printed text documents in Kannada, a South Indian language. The proposed OCR system for the recognition of printed Kannada text, which can handle all types of Kannada characters. The system first extracts image of Kannada scripts, then from the image to line segmentation then segments the words into sub-character level piece...
متن کاملAn Optical Character Recognition System from Printed Text and Text Image using Adaptive Neuro Fuzzy Inference SystemAn Optical Character Recognition System from Printed Text and Text Image using Adaptive Neuro Fuzzy Inference System
This is the age of digital systems. Now a days, everything is being computerized. Peoples are using mobile phones, laptop, computer, camera, notebook, pdf reader etc digital systems too much than ever. Use of papers and pen, printed books are decreasing. Rather peoples are using digital means of communication, study, documentation. Optical character recognition is an application of these digita...
متن کاملOptical Character Recognition: A Review
The Optical Character Recognition is the electronic conversion of image of typewritten or printed text into machine-encoded text. It is common method of digitizing printed texts. Advantages being easy storage, edit ability, searching, etc. OCR is a field of research in pattern recognition, artificial intelligence and computer vision. In previous decades it has gain more importance due to feasib...
متن کاملAn Approach to GUI Identification for Printed Gurumukhi and English Text
Optical Character Recognition system is used to recognize printed and handwritten alphanumeric text from input image. A numerous of methods have been published based on optical character recognition. In proposed work expansion of optical character recognition to recognize multi-scripts is done which in infancy. Such type of expansion is crucial in India where each state has diverse language. Th...
متن کاملImage preprocessing for optical character recognition using neural networks
Primary task of this master’s thesis is to create a theoretical and practical basis of preprocessing of printed text for optical character recognition using forward-feed neural networks. Demonstration application was created and its parameters were set according to results of realized experiments. Project definition and task determination 1. Write a introduction about the problematics of optica...
متن کامل